A maximum principle for relaxed stochastic 
control of linear SDEs with application to bond 
portfolio optimization 

Daniel Andersson* and Boualem Djehiche^ 
February 15, 2008 

Abstract 

We study relaxed stochastic control problems where the state equation is 
a one dimensional linear stochastic differential equation with random and 
unbounded coefficients. The two main results are existence of an optimal 
relaxed control and necessary conditions for optimality in the form of a 
relaxed maximum principle. The main motivation is an optimal bond 
portfolio problem in a market where there exists a continuum of bonds 
and the portfolio weights are modeled as measure-valued processes on the 
set of times to maturity. 
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1 Introduction 

The objective of this paper is to derive necessary conditions for optimality in 
relaxed stochastic control problems, i.e. the control is a measure-valued pro- 
cess, where the state process is a solution to a one dimensional linear stochastic 
differential equation (SDE) whose coefficients are random and not necessarily 
bounded. This study is motivated by the following optimal bond portfolio prob- 
lem. Consider a market of non-defaultable bonds, i.e. financial contracts that 
are bought today and pay a fixed amount at some future time, called the ma- 
turity time. At each time t, the investor is allowed to buy bonds with any time 
to maturity in U, where [/ is a subset of Modeling the prices of the bonds 
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as SDEs, we may write down the wealth of the investor as an SDE of the form 
(see Section [5] below) 



Xt=xo+ Xs I {r° - Vs{u)lds) fJ.s{du)ds + Xs Vs{u)ns{du)dBs, 
Jo Ju Jo Ju 

where xq is the investors initial capital and nt is a probability measure on U 
reflecting the proportion invested in bonds with different maturities. Further, 
r° is the short rate, Vt is the integrated volatility process of the bond prices 
and 8t is the so called market price of risk. The objective of the investor is to 
choose fit in some optimal way. Interpreting fit as the control process leads to 
a relaxed control problem where the state process is a linear SDE with random 
coefhcients, and where and Of cannot in general be assumed to be bounded. 

This motivates us to study relaxed stochastic control problems where the state 
equation is a one dimensional linear SDE 

Xt — xq + / / b{s,Xs,u)iis{du)ds + / / a{s, Xs,u)fj.s{du)dBs, 
Jo Ju Jo Ju 

on some filtered probability space (fl, J-', J-t,P) equipped with a d-dimensional 
Brownian motion Bf. The control variable is a process /it taking values in the 
space of probability measures on the action space U. Xq denotes the initial state, 
b and cr are random coefficients of the form 



b{t, x,u,uj) — Vt (u, Lo) + 4>t (u, uj)x, 
a{t,x,u,uj) = Xt(w, w) + i}^t{u,Lo)x, 



for given stochastic processes u, cf), x and ■0 taking values in some space of 
functions on U . The cost functional, which is to be minimized, is of the form 



J(/it) = E( / / h{t,xt,u)fit{du)dt + g{xT) 
u 




Under the usual assumptions on b and a, i.e. deterministic functions of 
{t, X, u), Lipschitz continuous and with linear growth in x, a maximum principle 
for stochastic (strict) control problems where the control enters the diffusion co- 
efficient was established in Peng jlTj . An extension to relaxed control problems 
is given in Bahlali et al. [1] . We refer to Cadenillas and Karatzas [1] for a result 
on stochastic (strict) control, in terms of a maximum principle for linear SDEs 
with bounded random coefficients, under integrability conditions on the control. 

This paper contains two main results. The first one, Theorem 13.11 estab- 
lishes existence of an optimal relaxed control which is derived using a similar 
scheme as in Ma and Yong [8] . The main tools in the proof are tightness and 
Skorohod's Selection Theorem. The second main result. Theorem 14.11 suggests 
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necessary conditions for optimality that are given in form of a relaxed maximum 
principle. The proof is based on the Chattering Lemma, which gives a sequence 
of ordinary (strict) controls that approximates the relaxed control. The proof of 
the maximum principle is based on Zhou's maximum principle [13) for near op- 
timal strict controls, and stability properties of the state- and adjoint processes 
with respect to the control. Note that the relaxed control problems studied 
e.g. in Bahlali et al. [1] is different to ours, in that they relax the corresponding 
infinitesimal generator of the state process, which leads to a martingale problem 
for which the state process is driven by an orthogonal martingale measure. In 
our setting the driving martingale measure ^t{du)dBt is however not orthogonal. 

The paper is organized as follows. In Section 2, we formulate the relaxed 
control problem for our linear SDEs. In Section [3] we prove existence of an 
optimal control, while in SectionlH necessary conditions for optimality are given 
in form of a relaxed maximum principle. In Section 5, we apply these results to 
formulate a maximum principle for an optimal bond portfolio problem. Finally, 
to make the exposition simple, all the proofs and technical details are collected 
in Section 6. 



2 Formulation of the problem 

Consider a one dimensional SDE on some probability space (il, J^, JT^, P): 

Xt=xo+ I b{s,Xs,Us)ds + / a{s,Xs,Us)dBs, (2.1) 
Jo Jo 

where a;o G M is the initial state, _B is a d-dimensional Brownian motion and J-t 
is the P-augmentation of the natural filtration defined by 

J^f =a{B,,se [0, <]) for aU t e [0, T]. 

Furthermore, the coefhcients are given by 

b{t, X, u, Lo) = vt{u, bj) + 4>t{u, lli)x (2.2a) 
a{t, X, u, uj) = xt{u, Lo) + %l^t{u, Lo)x, (2.2b) 

where v : R+ x U x n i-^ R, (j) : M.+ x U x ft i-^ R, X ■ '^'^ 'x U x n i-^ R'^ 
and : M+ x [/ x i— > R'^ are J^t-adapted processes. For each t e [0,T] the 
control ut is in the action space U, a compact set in R". Let U denote the 
class of admissible controls, i.e. J-t-adapted processes with values in U. The 
cost functional is given by 

J(u)=El/ h{t,Xt,Ut)dt + g{xT)] , (2.3) 




and the objective is to minimize J over the set of admissible controls. A control 
u* is called optimal if it satisfies J{u*) = inf{ J(m); u G U}. If also u* e U, it is 
called a strict optimal control. 
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We make the following assumptions regarding the state equation (|2.ip , (|2.2p 
and cost functional (12.31). 



(A.l) ipt{u,uj) is continuous in (t,u), where ip stands for one of the processes 

(A. 2) -0 is uniformly bounded in M+ x U x fl. 
(A. 3) For any k G (— oo, oo) it holds that 









E ^exp 


k / (j)t{u)dt 


^ < oo, for all u e U. 




Jo 





(A. 4) For any fc > 0, it holds that 

E (^J^ \vtiu)\''dtj < oo and E \xt{u)\'"dtj < oo, 

for all u G [/. 

(A. 5) The functions g and h are twice continuously differentiable in x. The 
function g and its first and second derivative are bounded and Lipschitz 
continuous in x. The function h and its first and second derivative are 
bounded, continuous in u and Lipschitz continuous in x. 

Throughout the rest of the paper we will not specify that properties hold P-a.s. 
when it is clear from the context. We denote for any process ipt, 

Iflr^ = sup \ipt\P. 

te[o,T] 

This kind of control problems is often formulated in the so-called relaxed form, 
due to the fact that a strict optimal control may fail to exist (see e.g. Bahlali 
et al. [T] for a discussion). Instead one embeds the strict controls in a wider 
class of controls that takes values in probability measures on U rather than on 
U itself. Also, a solution to a relaxed control problem is a weak one, i.e. the 
probability space, equipped with the a priori given stochastic processes, is part 
of the solution. 



Let ViU) be the space of probability measures on U. If Ht{du) is a stochastic 
process taking values in 'P{U), we denote by £([0, T], U) the space of the (Radon) 
measure- valued processes dXt{u) — iit{du)dt. If a probability space (ri,jr, P) is 
given, then we denote M{^) the space of all JT^-adapted processes fit{du) taking 
values in 'P{U). Further, we denote by X(0) the space of all £([0, T], C/)-valued 
^(-adapted processes. It can be shown that there is a one-to-one correspondence 
between M{Vl) and L{Q), and that £([0, T], U) is a compact metric space. For 
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further discussion, see Ma and Yong [S]. 



Throughout we denote /(/it) — jjj f{u)^,t{du), for any continuous function /. 
By expanding the set of controls from U to M, the state equation is defined as 



Definition 2.1. A relaxed control is the term A = {fl, J-jJ-'tjP, Bt, fit,xt), 
where 

(i) (ri, JF, J-"t, P) is a filtered probability space; 

{ii) Bf = {Bt,Vt,4>t,Xt,ipt), in which Bt is an Tt-Brownian motion and 
vt,4>t,XtT'4't o,fs Tt-adapted stochastic processes satisfying (A.1)-(A.4); 

(m) ^it £ M{n); 

(iv) Xt is J- 1- adapted and satisfies (I2.4p . 

We denote by the set of all relaxed controls. The cost functional correspond- 
ing to the control A is defined as 



and a relaxed control A* is optimal if J{A*) = inf{J(^);^ G lA^}. It is well 
known that U may be embedded into , since any strict ([/-valued) control 
process ut can be represented as a relaxed control by setting /if [du) = Sut (du) . 
Moreover the so-called Chattering Lemma, stated in Section IH tells us that any 
relaxed control is a weak limit of a sequence of strict controls. 

3 Existence of an optimal relaxed control 

In this section we shall establish the existence of an optimal relaxed control. To 
achieve this, we construct a minimizing sequence of controls A'^'^'' G for the 
cost functional J, i.e. 



and show that a limit A exists and fulfills (i) — (if) in Definition 12.11 This 
will be carried out i several steps described in Lemmas 13.1113.31 below, (cf. the 
scheme suggested in Ma and Yong |S]). The main tools are tightness of the 
processes and Skorohod's Selection Theorem. To make the exposition simple, 
we will consider a simpler form the cost functional J, by letting h = 0. The 
proofs can be modified so that the results hold without this restriction. This is 




(2.4) 




(2.5) 



m{{J{A),Ae U'^] = lim J(y^W), 
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done by adding h{xt,u)fit{du)dt as an "extra state" (cf. Yong and Zhou 

Given a relaxed control A — (O, J-t,P, Bt, ^nXt)^ there exists a unique strong 
solution to the equation given by ()2.2p and (|2.4|) . Moreover, its explicit form is 

xt ^ zt[xo+ / as + / dBs , (3.1) 

V Jo Jo J 

where 

zt = exp (^(f>s{fJ.s) - J ■ (^-2) 

This can be verified by applying Ito's formula on p.ip - p.2p . Moreover, with 
the assumptions (A.1)-(A.4) we can prove by standard methods that Xt has the 
following properties: For any p > 1 we have 

E\x\*j^P < oo, (3.3) 

and there exists a constant K > such that 

E{xt-Xs)'^ <K\t~s\'^, (3.4) 

for all s,te [0,T]. 

Lemma 3.1. Given a relaxed control A = J- , Tt, Bt, ^it, Xt) , there exists 
a sequence /ij'^'' £ MiVt) such that for each k, the path ^^^'\A) is continuous 
for all Borel sets A; and 

E\x'^''^ - a;|^'^ ^0, as fc ^ oo, (3.5) 

where a;^*^-* is the solution to (j2.4p with respect to ^^f^^ . Consequently, setting 
= {n,T,^,TuBu^l^^\x'^^^), we get 

as fc — > oo. 

Let A^^^ = (f^W, jP'('^'),P('=),Jf;'''\Bf\/if\a;f^) be the minimizing sequence, 
i.e. 

J(^('=)) ^ \ni{J{A);Ae U"}, as fc ^ oo. 

For this sequence we have the following important property. 

Lemma 3.2. Denote d\^f^\u) — iif^\du)dt. Then the sequence {b[''\ x\''\ x['''') 
is tight in (C([0,T]) x C([0,T] x U)^) x C{[0,T] x U) x C([0,r]). 
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By Lemma 13.11 we may assume that iMj.''^ (A) has continuous paths for each 
Borel set A. By tightness and the Skorohod's Selection Theorem there ex- 
ists a probabihty space {i},T,¥), on which is defined a sequence of processes 

{b[ \ x['^\ x[''^ ) identical in law to {B['^\ x['^\ x[''^ ) and converging P-a.s. to 
{Bt, Xt, xt). Moreover, by Lemma 2.1 in Ma and Yong [8] the processes /tj'^'' 
corresponding to a|'^'* have the same law as since has continuous paths. 
We drop the " ' " in the following. 

Lemma 3.3. Let b and a be the processes defined by (|2.2p . then 

b{t,xt,^lf^) b{t,xt,fit), 
cr(t,a;t,^f ^) a{t,xt,fj.t), 

m L2([0,T] X n), ask^oo. 



Proof. See Ma and Yong [8], Lemma 3.3. ■ 

Again, using Skorohod's Selection Theorem, there exists a limit A of the mini- 
mizing sequence A^^^ which satisfies (i) — {Hi) in Definition 12. II By Lemma [3731 
and the fact that {b'^^\ X^l'\ x^^^'^ ) — » {Bt, Xt, xt) P-a.s., as fc ^ cxd, we prove in 
the next theorem that (iv) also holds: 

Theorem 3.1. The limit xt of x'^^^ satisfies 

Xt = xq + / b{s,Xs, Hs)ds + / (t{s,Xs, fJ.s)dBs. 
Jo Jo 

Therefore, A — {fl, J-,P,J-t, Bt, Ht, Xt) is an optimal relaxed control. 



4 A relaxed maximum principle 

We start by proving the so-called Chattering Lemma, which states that any 
relaxed control may be approximated by strict controls. 

Lemma 4.1. (The Chattering Lemma) 

Let A = {Q, J-t,P, Bt, i-H, Xt) be a relaxed control. Then there exist relaxed 
controls A'^^'^ = {Q, Tt,P, Bt, 6 w,x't''^), A^''^ = {n, T, Tt,¥, Bt, 6 , x[''^) 

and A = {tl, j-, !Ft,^T Bf, fit, Xf), where v!'^^ and u'f^'^ are sequences ofU-valued 
progressively measurable processes defined on {Q,J-,P) and {Cl,J-,¥) respec- 
tively, such that S (k) and (5.(fc) as well as pit o-nd fit are identical in law and 

such that if we denote dxi''\u) = S..(k){du)dt and dXt{u) — jlt{du)dt, then 

Ai'^ - At, (4.1) 
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as fc —> oo P-a.s. in £{[0, T] x U). Moreover, and xj. as well as Xt and xt 
are identical in law and 

E|iW - ^i;,'^ -> 0, (4.2) 

as fc —> oo. 

With the definitions in the Chattering Lemma we thus have 

J(^('^') ^ J(^), (4.3) 

as fc — > oo, and consequently 

inf{ J(^); ^ e U"} = mi{J{u)-u G U}. (4.4) 

The latter equality motivates the use of relaxed control even when one is only 
concerned with strict controls. The strict and the relaxed problems have the 
same optimal value. However, this optimum may not be reached by a strict 
control but only with a measure- valued one which in turn can be approximated 
by strict controls. 



By the Chattering Lemma we can assume that any relaxed control has an ap- 
proximating sequence defined on the same probability space. Let A = 
(r2, J^^ P, !Ft, Bt, fit,xt) be an optimal relaxed control. From now on we let this 
filtered probability space be fixed and vary only the control measures and cor- 
responding state processes. 

4.1 Adjoint processes 

We recall the first- and second order adjoint processes for the state process 
(|2.ip - (j2.2|) . These are two pairs of processes {p,q) and {P,Q) with values in 
M X M'' defined for any strict control u £U. We denote by fx and fxx the first 
and second derivative, respectively, with respect to x of the function /, where 
/ stands for either g or h. Then (p, q) and {P, Q) are given by 

dpt^ -{4>t{ut)pt+'4't{ut)qt + hx{xt,Ut)^dt + qtdBt 
PT = 9x{xt) 

dPf = -(2(l)tiut)Pt+Mut)PtMut)+2Mut)Qt 

+hxx{xt,ut)yt + QtdBt (4-6) 
Pt = gxx{xT) 

The second order adjoint process (|4.6p appears when the control affects the un- 
certainty (noise part) of the system, i.e. when the diffusion coefficient depends 
explicitly on u (cf. Peng [11]). Also, note that the reason for the extra compo- 
nents q and Q is to make it possible to find adapted solutions to the backward 
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SDEs (see Ma and Yong [3] for further discussion). Next, we introduce the 
Hamiltonian of the system: 

H{t,x,a,p,q) = -h{x,a) - p(vt{a) + (j)t{a)x^ - q(xtia) + ipt{a)x^ 

for (t,x,a,p,q) £ [0, T] x M x [/ x M x . Further, we define the H-function 
corresponding to a given strict control ut and its corresponding state process Xt 

by 

X, a) ^H^t, X, a,pt,qt - Pt{xt{ut) + Mut)xt)) 

- +'^t{a)x^Pt[xt{a) + il^t{a)x^ 

for (t, x, a) G [0, T] X R X [/, where pt, Pt and qt are determined by the adjoint 
equations (|4.5p and (|4.6p . The next proposition relates the 7i- function to the 
cost functional. It was first proved in Peng [llj for the case of general SDEs 
with bounded coefhcients. 

Proposition 4.1. Let u ^lA with state process Xt he given. Denote by the 
perturbed control: 

V for t £ [r, T + 6] 
Ut otherwise. 

Then there exists two pairs of processes {pt,qt), {Pt,Qt) which solve (|4.5p - (|4.6p . 
such that 



E|p|;,'P+E r \q\Pdt<oo, 
Jo 

E|P|;,^^' + E / \Q\Pdt <oo, 
Jq 



for any p > 1, and such that the following holds. 

J{5uo) - J{Su) = E^ xt,ut) - xt,4)) dt + o{e). 

4.2 Necessary conditions for near optimality 

By Proposition 14.11 we can derive the following necessary condition for a 'near- 
optimal' strict control in terms of the 7^-function. 

Proposition 4.2. Let Ut be a strict control such that 

J{5u) < J{(i) + e, 

then there exists constants K > and 7 > such that the following inequality 
holds 

nT nT 

E n^''*^'''\t,xt,ut)dt >supE n^'='-'''\t,xt,a)dt- Ke'^. (4.7) 



aeu Jo 
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4.3 The relaxed maximum principle 



We define the adjoint equations corresponding to a relaxed control fit as in (|4.5p 
and (|4.6p with Lpt{ut) replaced by (ftifJ^t) = Ju Vt{u)d^t{u) where ipt is any of 

dpt= -(4>t{p-t)pt+'>Pt{p-t)qt + K{xt,fJ.t)^dt + qtdBt ^^g^ 
PT = 9x{xt) 



dPt = -{2(i)t{fit)Pt + iH{f^t)Pti^M + 'ii^MQt 

+Kx{xt,fit))dt + QtdBt (4-9) 
Pt = gxxixr) 

The 7i-function associated with fit is defined analogously; 

^(""^'^(t, X, u) =H(t, X, u,pt, qt - Ptixtipt) + Mpt)xt)) 

- \[xtiu) + Mu)x)Pt{xt{u) + My')x), (4.10) 

for {t, x, u) e [0, T] X M X U. 

Finally, the function with u G V{U) as the control variable is just (|4.10p 
integrated with respect to z/: 

X, ly) =H{t, X, iy,pt, qt - Pt{xt{fit) + i^Mxt)) 

- \ {xt{i^) + M^)^) Pt (xt{y) + , (4.11) 

for {t,x,v) e [0,T] X R X V{U). 

The following result states that the integrated 7i-function associated with the 
optimal relaxed control fi is the limit of the integrated 7i-function associated 
with the approximating strict control sequence u^^\ 

(k) 

Lemma 4.2. Let x). be the state process corresponding to the control sequence 
u''^'-' given by the Chattering Lemma, then it holds that 



hm E 



n 



n^^^*'(''\t,xt,h)dt. 



The main result of this section is the following maximum principle. 

Theorem 4.1. [Relaxed P entry agin' s Maximum Principle) 

If fit is an optimal relaxed control with state process Xt, then for any t outside 

a null set 

n^^''^''\t,xt,fit) = sup n^^"^'*\t,xt,iy), F-a.s. 

v£V{U) 
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Using a similar proof as e.g. Corollaries 4.8 and 4.10 in Bahlali et al. [T], Theorem 
14. H is derived from the following integrated maximum principle. 

Proposition 4.3. {Integrated Maximum Principle) 

If fit is an optimal relaxed control with state process Xt, then 

E H^^*^'^'^t,xufit)dt = supE H^^''^'\t,xt,u)dt. 
Jo y-eu Jq 

Proof. By the Chattering Lemma there exists an approximating strict control 
sequence u[''^ and a corresponding sequence of real numbers e'-'^-' —^ such that 

Sending /c — > oo and using Proposition |421 and Lemma |4. 21 completes the proof. 



5 An optimal bond portfolio problem 

We recall the basic Heath- Jarrow- Morton setup, see Bjork [2]. Given a filtered 
probability space {il,T,!Ft,P) carrying a d-dimensional .Ft-Brownian motion 
Bt, the forward rate ft{T), for each fixed r, follows an SDE 

dftiT)=at{T)dt + dt{T)dBt, (5.1) 

where atir) and dtir) are M- and R''- valued adapted processes respectively, and 
T denotes time of maturity for the zero coupon bond. 

The bond market induced by the forward rates (|5.ip is free of arbitrage, in 
the sense that there exists an equivalent martingale measure, if and only if a 
can be represented as 

at{T) = at{T) j ^ dt{s)ds - dt{T)Qt, 

where <dt is an adapted process such that the Dolean's exponential i?(/g QsdBg) 
is a P-martingale. The process is known as the market price of risk. 

Consider a market of zero coupon bonds with times to maturity in the 
interval U = [0,r*]. The forward interest rate under this so-called Musiela 
parametrization is given by 

rt{u) = ft{t + u), ue U, 

e.g. = rt(0) denotes the short rate at time t. The re-parametrization yields 

drt{u) = (^^'"t('") + '^t('") j ^t{x)dx - at{u)Qt^ dt -f at{u)dBt, (5.2) 
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where Ut (u) = (yt{t + u) . The relation between bond prices and forward interest 
rates is given by 



Pt{u) exp l^-y rt{x)dxj , (5.3) 

and thus by applying Ito's formula on (jS.Sp and inserting Eq. (j5.2p . one can 
express the dynamics of the bond prices as 

dpt{u) = pt{u) (r° - rt{u) - vt{u)Qt) dt + pt{u)vt{u)dBt, (5.4) 

where vt{ ) — ~ Jq <Jt{x)dx. 

Investing in bonds with the price dynamics as above gives the opportunity 
to, at any time, choose among a continuum of assets. Namely one for each 
maturity u G U. This gives rise to the problem of how to define a portfolio. A 
reasonable choice is to consider measure-valued portfolios, as is done in Bjork 
et al. [3] . Using measure- valued portfolios also ensures the existence of a locally 

risk free bank account, bt = exp ^ r'^ds^ , since this investment is equivalent 

to a so called roll-over strategy, see e.g. Bjork [2]. This strategy is performed 
by continuously reinvesting the entire portfolio value in the just maturing bond, 
and over an arbitrary time interval uses an infinite number of assets. 

We define a portfolio as a measure- valued process pt{du), u € U. Intuitively 
Pt{du) is the "number" of bonds in our portfolio at time t, with time to maturity 
in the infinitesimal time interval \u,u + du]. We denote by Xt the value of the 
portfolio at time i, i.e. 



Xt ^ pt{u)pt{du). 
Ju 

Further, the portfolio is self financing, i.e. the increments of the portfolio value 
are due to price changes only. Referring to Ekeland and Tallin [5], we may 
formally express this as 

dxt = j pt{du) (^dpt{u) - -^pt{u)dt 

where the last term appears because we use the Musiela parametrization. In this 
setting the portfolio value changes due to both price changes as well as to changes 
in time to maturity. Using Eq. (|5.4p and noting that -^pt{u) = —pt{u)rt{u), 
the above relation is interpreted as follows (see e.g. Bjork et al. [3])- 

Xt ^xq + / / ps{u) (r" - rs{u) - Vs{u)Qs) Ps{du)ds 
Jo Ju 

/ ps{u)vs{u)ps{du)dBs + / / ps{u)rs{u)ps{du)ds 
lo Ju Jo Ju 

^xq + / ps{u) (r^ - Vs{u)lds) Ps{du)ds + / / Ps{u)vs{u)ps{du)dBs, 
Jo Ju Jo Ju 
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where xq is the initial capital. Considering only portfolios with positive holdings 
(i.e. no short positions), we may write 

pt{du) = ——fit{du), (5.5) 
Pt[u) 

where /it £ M{il), i.e. it takes values in the set of probability measures on U. 
This so-called relative portfolio is the proportion of the portfolio invested in 
bonds with time to maturity in [u, u + du]. 

Now, inserting the expression (|5.5p into the portfolio dynamics above yields 

Xt^XQ+ I Xs I {r° - Vs{u)Qs) IJ's{du)ds + Xs Vs{u)pLs{du)dBs. (5.6) 
Jo Ju Jo Ju 



The aim is to control this self-financing portfolio, via fit, in an optimal way. 
Assuming that our goal is to minimize the cost functional 

J{fi) ^¥.^J J h{t,xt,u)fit{du)dt + g{xT)^ , 

we get an optimal control problem on the form (|2.2p . (|2.4p . ()2.5p . with vt — Xt = 
0, 0t(M) = (r^ - vt{u)Qt) and V'tC") = vt{u). 



Example 5.1. Consider a passive investor who invests the initial capital into N 
number of bonds with times of maturity Ti, . . . , T/v, at time and does nothing 
thereafter, i.e. 

N 
i=l 

where Pt{T) denotes the price of a bond with time of maturity T . This corre- 
sponds to a portfolio consisting of bonds which, at time t, have times to maturity 
Ti — t, . . . , Tn — t. Thus, our relative portfolio is 

N 

lj,t{du) = ^(5(T,-t)(dw), 
1=1 

and Eq. (|5.6[) becomes 

N t N .t N 

a;t=^Po(Ti)+ / Xs'^irl - Vs{Ti - s)Qs) ds + / Xs VsiTj - s)dBs. 

^=l ^=l »=1 

Note that by the Musiela parametrization we are in a moving time frame and 
therefore, although the investor is passive, the control measure changes contin- 
uously in t. 
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5.1 Mean variance portfolio selection 

In this last section we derive the adjoint equations and 7i-function for a specific 
example. The cost functional corresponds to a mean variance portfolio selection 
problem and two different choices of interest rate processes are considered. In 
principle, necessary conditions for a portfolio to be optimal can be found by 
maximizing the ?i-function with respect to a measure on U. Unfortunately, 
the BSDEs for (p, q) and (P, Q) are quite involved and it seems difficult to find 
explicit solutions. 

Assume the following cost functional: 

J(Ai) =eQ(xt , (5.7) 

with given constant k. Minimizing J (for a certain k) is equivalent to a mean 
variance portfolio selection problem. Assume that (it with corresponding port- 
folio value Xt is optimal. Using the relaxed maximum principle we may write 
down the necessary conditions for fit and Xt- The adjoint equations becomes 

f dpt = -(^{r'^ -Vt{fit)Ot)pt+Vt{p.t)qt^dt + qtdBt, 



Pt ^ XT - K, 



and 

f dPt 



-((2(rO - vtiMQt) + {vt{Mf)Pt + 2vt{fit)Qt)dt + QtdBt, 
[ Pt= 1- 
The corresponding H-function is 

H^^*^^'Ht,Xt,i^) ^-pt(r1 -vt{i^)et)xt - [qt-PtVt{fit)xt)vt{iy)xt 

-^Ptv^^jxl (5.8) 

where again, vt{v) = ^jjVt{u)v(du) and vf(v) = jjjv1{u)v{du). 

5.1.1 Ho-Lee 

Choosing the volatility process to be constant, 

at{u) = cr, 

and consequently 

vt{u) = -au, 

the short rate is a Gaussian process. Under a obvious integrability assumption 
on Qt we then have that (A.1)-(A.3) are fulfilled. Thus by the relaxed maximum 
principle, a necessary condition for a portfolio (it to minimize the cost functional 
(|5.7p . is that it maximizes Eq. (|5.8p with 



Vt{v) — — cr / uv{du). 



u 
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5.1.2 Hull- White 

Another choice of volatility process that induces a mean-reverting Gaussian 
short rate is 

at{u) = ere"™, 

with constants a and c. Thus, a necessary condition for optimality of a portfolio 
fit is that it maximizes Eq. (|5.8p with 

VtH = - [ (e-^" - iHdu). 
c Ju 

6 Proofs and technical results 

Throughout this section, we denote hy K > a generic constant that may vary 
from line to line. 



Proof of Lemma 13.11 The second assertion follows easily from the first by 
using the Lipschitz property of g. For the first assertion, define (pointwise in 
r2) for = 1, 2, . . . and A a Borel set; 



1 f* 



Obviously, for each k, ^[''\A) is continuous for all Borel sets A. Moreover, one 
can show (cf. Ma and Yong [8j) that for any / G C(f7, M") the following holds 
for {t, Lu) outside a null set 

/(/if "*) ^ fif^t), ask-^oo. 

In particular, 

vt{^J■['''') vt{^J.t), (6.1) 

P-a.s. for t outside a null set, where cp stands for one of the processes v, </>, x, ip. 
Denote yt = Xj'^'' — Xt, then yt can be expressed as 



yt 



+ ^xs{^^i'^)-xs{^^s) + {M^^i'^)-M^^s))xi''^)dBs+ f' M^^s)ysdB,. 



Let 



Zt = 1 - / (ps{pi.s)zsds, 
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and apply Ito's formula on ztyt to get 



I Zgds 



iJs{iis)zsysdBs. 



Noting that is bounded, we apply the Burkholder-Davis-Gundy, Gronwall and 
Holder inequalities to get that for any p > 1, 

+ e£ I xs (mI"' )~xs{t^s)\ ^^dt + (e I (^1'=) )~i^s{^Jis)\'^dt)'^Y 

where all the terms on the right hand side converge to as fc — > oo by the 
Dominated Convergence Theorem. We note that by (A. 3) we have E|z|^''' < oo, 
E|z~^|^'^ < oo, for any p > 1. Thus, using the Holder inequality we conclude 
that 

1/2 



E|a;W - x\*/ = ¥.\z-^zy\*/ < K{¥.\zy\*/^ ^ 0, 



as fc ^ oo. 



Proof of Lemma l3.2l It suffices to check that the marginals are tight. -B^'^^ is 
tight since the processes induce the same measure for every k. Further, Aj*"^ is 
tight because £([0, T] x U) is compact. Finally by p.4p . there exists a constant 
K > Q such that 

EW(a;(")-a;W)4<i^|s-i|2, 

for all s G [0, T], for all fc, where E^*^^ is the expectation under P^'^^. Hence the 
Kolmogorov condition is fulfilled (see e.g. Yong and Zhou [12], Theorem 2.14.) 
and xj'^'' is tight. ■ 

Proof of Theorem 13.11 Define the set ^ = | Ms); ^■(■s, ^s, Ms)) : Ms S 

M(f7)|. Then /C is a convex set in ^^([o^T] x fi) x L'^{[Q,T] x n) and by 
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Mazur's Theorem the weak closure of K, equals its strong closure. Thus, by 
Lemma [3731 for each integer I > and e > 0, there exists a finite set of numbers 
{«!, . . . , Q;jv(i,e)} satisfying ai > and = 1, such that 

E/ oiMt,xuixl^'^)-h(t,xuiXt)ydt (6.2) 

•^0 j=l 



+ IE / I ^ ai(T(t,Xt,/iJ''^''') - cr(i, a;f,^t)|"(it < e. (6.3) 
■^0 i=l 

By uniform integrability, for any e > 0, there exists an integer iVo(e) > 0, such 
that 

I*. 4 I (u\ I*, 4 

It + r ~At 



E(|xW-a;|;''+|i3('^)-B| 



for all A; > Nq and u E U. Fix such an e and let N = N{No, e) and {ai, . . . , a^} 
be such that a, > 0; J2i cti — ^ and (|6.3p holds. Denote by isT a generic constant 
that may vary from line to line. Define for each k — 1,2, . . ., 

b^'''>{t,xt,^it)^v[''\^lt) + 4''\^lt)xt, 

a^'\t,xt,^it)^xi'\fit)+4'\^^t)xt, 
and for each i — 1, . . . , N 





Then it is readily seen that 

N 



E|^a,A'(S)|;' <Xe. 

i 

Similarly, let 

A\x)t = f\^''"+'Hs,xi^'>+'\fii'^°+'^)ds- f\^''°+'Hs,Xs,fii''"+'^)ds, 
Jo Jo 

A'{x)^ = /*a(^°+'')(s,x(^«+'),Aif"+*))di3, - /*a(^«+^)(s,x„/if«+^^)dSs, 
Jo Jo 

A\b)t^ r6(^»+')(s,x„/ii.^«+^))ds- f bis,xs,^i^^°+'^)ds, 
Jo Jo 
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and conclude that 

N N N N 



E 



i i i i 

< Ke, 



where we have used the Burkholder-Davis-Gundy inequahty for the martingale 
terms. 

Note that for each k, xf'^ satisfies 

Jo Jo 

and thus 

JV 



QiX^''^" = x+ / b{s,Xs, ii)ds + / (j{s,Xs,lj)dBa 

iV JV JV TV JV 

+ ^ A\B), + J2 A'ixt + J2 ^\^)t + E ^^(^)* + E ^'(^)* 

i i i i i 

+ (^E"*^(*'^^'^^^°'^''') ~^(^'^^''^^))^'^^• 
Since f{x) = x^ is convex, it is easy to check that 



Combining this with the previous inequalities and using the Burkholder-Davis- 
Gundy inequality, yields 

E(^\xt-xo- J b{s,Xs,lj)ds - j a{s,Xs,ij)dBs\*^^^ 
<K\e + E / \y2aib{s,Xs,ni^'>+'^)-bis,Xs,n)\^dt 

+ E / \y2c(^<^{s,Xs,^li^"^"^) - <j{s,Xs,n)\^dt\ 



<Ke. 

This completes the proof. 
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Proof of Lemma 14.11 The construction of the approximating sequence u^^'^ 
is done as in Ma and Yong [8j, Theorem 3.6. Then by tightness and Skorohod's 
Selection Theorem S.w{du)dt fit{du)dt P-a.s. in C{[0,T] x U) and g^) 
follows in the same way as in Theorem 13. II ■ 



Proof of Proposition 14.11 Let denote the state process corresponding 
to . We proceed as in Peng [Tl| to introduce the first- and second order 
variational equations (noting that bx — (t>, cr^ = ip, hxx = (^xx = in our case): 



ft 





(t)s{Us)x''^'^^ + Vs{ul) + (t)s{ul)Xs - Vs{Us) ~ (t>s{Us)x^ds 
{%l}s[Us)x'-g^ + Xs{ul) + ll)s{ul)Xs - Xs{Us) - 1ps{Us)Xs^dBs, 



+ ^ + {MO - M^s))xi'^'^ 

Then we have the following estimate. 

E\x' -x-x^^') -x^^'^l*/ <Ke\ (6.4) 
To prove (|6.4p . note that for any p > 1 it holds that 

Elx'^^^l*/'' < KQP (6.5) 

and 

As in Peng [11] , we can write 

xt + x[^^ + a;f ^ =xo + (t^s(uf ) + </-«(«!) (a;. + 4'^ + x^^^))ds 

+ f (xs{u'^ + MOi^s + + xfy))dB, 
t 



- I GUs- I A^dBs. 



where 



A = 
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We have for and A® that 



GUs 



*,2 
T 



KdBs 



(6.7) 



Thus, 



GUs+ I KdBs, 



which together with (HH) and leads to ([13) ■ With this result, a 

Taylor expansion of the cost functional as in Peng [TP gives us 

J{5^e)-J{5u) 



+E / (/i(a;„uf)-/i(a:„u,))ds + E(.g,(xT)(4'^ +4'^)) (6-8) 



The next step is to express the right hand side in terms of the first- and second 
order adjoint processes. We start by deriving the former. Define 

JQ Jo 

By Ito's formula is given by 

$-1 = 1 + / (AMMus) - Mus))K^ds - [ i^s{us)K^dBs. 



(6.9) 



By a simple manipulation we deduce the moment property 

E\'S>\*^P +E\^-^\*^P < oo, 
for any p > 1. Next, we introduce 

^^Tgx{xT)+ ^thx{xt,Ut)dt, 
Jo 

pt 
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Since and are bounded we can use (|6.9p to deduce that 

E|x(i)|''<oo, 

for any p > I. Thus by the Martingale Representation Theorem there exists an 
^(-adapted process Ht with the property that, for any p > I, 

E [ \Ht\Pdt < oo, 
Jo 

and such that 

Jo Jo 

We may now define our first order adjoint processes {p, q) as 

qt = 'i>t^Ht-Mut)pt, 
noting that pt and qt are R- resp. R''-valued JFj-adapted processes satisfying 



E\p\*j^P+E [ \q\Pdt <oo, 
Jo 



for any p > 1. Applying Ito's formula on pt = $t ^Vt^^ yields 

dpt = -{hx{xt,ut) + pt(l)t{ut) + qti>t{ut)^dt + qtdBt. (6.10) 
Using (|6.10p . and once again by using Ito's formula we can derive 

'e(^Ptx':P^ =E j (^pt{vt{ul) + (j)t{ut)xt - vt{ut) - (j)tiut)xt) 

+ qt{xt{uf) + %l)t{ul)xt - Xt{ut) - TptMxt) - hj:{xt,ut)x[^^^dt. 

Similarly, 

/■T 

=E / +'?t(V^t("?) -^tK))a;i'^ -/i..(a;t,"t)a^P^)rf^- 

(6.11) 

Now, noting that px ~ ^^^^rgxixr) — 9x{xt) and that the first two terms on 
the right hand side of (|6.1ip is of order 9^/"^ , we may combine the two equalities 
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E 



above to 

E j qt(xt{ut) + 'ipt{ul)xt ~ Xt{ut) ~ ^t{ut)xt 

{K(xuut){x^^^ +xf^))dt + o{e). 
^ ^ 

Thus, we may write the right hand side of (|6.8p as 

E j (^H{t,xt,ut,pt,qt) - H{t,xt,uf,pt,qt)^dt+^E J hxx{xt,Ut)x[^^ x^t^^ dt 



2 

It remains to replace the second order terms with the second order adjoint 
processes. Define 

^t = l+ f {2(l>s{Us)+1ps{Us)i's{Us))'i^sds+ [ 2^Js{Us)^sdBs. 

Jo Jo 
Thus, \E'~^ is given by 

^T' = 1 + / (3V'sK)V'sK) - 20,K))*7ids - / 2^s{us)'f-^dBs, 
Jo Jo 

and as above the moment property 

E|$i;,'p + E|$-i|;,'^ < oo, 

holds for any p > 1. Next, we introduce 

= -^Tgxxixr) + / 1'tKx{xt,Ut)dt, 
Jo 



yl^> ^E{X^^^\Tt) ~ / ^shUxs,Us)ds 



where gxx and h^x are bounded so that 

E|x(2)|f 







< oo, 



for any p > 1. Again, by the Martingale Representation Theorem there exists 
an J-"f-adapted process Kt with the property that, for any p > I, 



E / \Kt\Pdt < oo, 
Jo 
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and such that 



lo Jo 
The second order adjoint processes {P, Q) is defined as 

noting that Ft and Qt are E- resp. R''-valued JF^-adapted processes satisfying 

E|P|;,'^' + e/ \Q\Pdt <oo, 
Ja 

for any p > 1. Applying Ito's formula on Pt ~ "^^^yf^ yields 

dPt = -(^h^Axt,ut) + 2PtMut)+PtMut)Mut) + 2QtMut))dt + QtdBt. 
By another application of Ito's formula we deduce 



=E 



T 

(2Pt{b{xt,u'l) - b{xt,ut) +Tpt{ut){cr{xt,u'l) ~ a{{xt,Ut))x''^^ 

+ 2Qt{cf{xt,u1) - a{{xt,ut))x^^'' - h^x{xt,ut)x[^^ x'i^^ (6-12) 
+ {a{xt,Ut) - a{xt,ut))Pt{(j{xt,ul) - a{xt,Ut))^dt. 

Noting that Pt = ^^^^rgxxixx) = Oxxixx) and that the first two terms on 
the right hand side of (|6.12p is of order O^^"^, we get 

T 







^■'(^gxx{xT){x^T''^T^)) =^ I {(7{xt,u1) - a{xt,Ut))Pt{cr{xt,uf) ~ a{xt,ut))dt 

{hxxixt,ut)x';}^x[^''^dt + o{9). 





E 



Hence we can remove the second order terms on the right hand side of (|6.8p to 
get 

E j (^H{t,xt,ut,pt,qt) ~ H{t,Xt,Ut,pt,qt)^dt 

1 

+ -E / {aixt,uf)-a{xt,ut))Pt{<7{xt,u^t)-'^i^t,ut))dt + o{e) 



E 







xt,ut) - (i, XtA)) dt + o{e), 



which completes the proof. 
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Before we can prove Proposition [321 we need some preliminary results. We 
start by defining a metric on lA: 

d{u,u') =F®dt|(w,i) e r2 X [(},T];u{uj,t) ^it'(tj,t)|, (6.13) 

where P eg) is the product measure of P and the Lebesgue measure. Then we 
have the following result. 

Lemma 6.1. (i) {U,d) is a complete metric space. 

(ii) For any p > 1 there exists a constant K such that for any u,u' S U, it 
holds that 

^x-x'\*/^ < K{d{u,u')Y'^, (6.14) 
where Xt and x'l are the state processes corresponding to u and u' . 
(Hi) For any u,u' CzU there exists a constant K such that 

\J{5u) - J{5u')\ < K{d{u,u')f'\ (6.15) 

Proof, {i) can be proved as in Elliott and Kohlmann [7|, see also Zhou [13]. 
As for (m), denoting yt = Xt — x'^, we have that yt satisfies the SDE 



yt 



Let 

Z( = 1 - / (j)s{u'^)zsds, 
Jo 

and apply Ito's formula on ztyt to get 

ZtVt = J (^VsiUs) - l^s{u's) + {4>s{Us) - (f>t{u's))xs^ 
+ I {Xs{us) - XsK) + {'>Ps{Us) - 1ps{u'^))xs^ 

Zsys'ipsiu's)dBs 



(vs{us) - Vsiu's) + {(l)s{us) - (l)tiu'^))xs^ds + J (l)s{u'^)ysds 



Zgds 



{vs{Us) - Vs{u'^) + {(t>s{Us) - (t>t{u'g))x^lf^.^^^^,^}Zsds 
+ I {Xs{us) - Xs{u's) + {lps{Us) - iJs{u's))xsY{u,^u'jZsdl 

Zsysi>siu's)dBs- 



24 



Then, using the Burkholder-Davis-Gundy and Holder inequahties 

Thus, by Gronwalls inequahty, 

¥.\zy\*/'' < K{d{u,u')Y''' . 

Finally, we conclude that 

E|a; - x'\*/P = E|z-iz2/|*'2p < K{^zy\*/Pf'^ < K{d{u,u')f'^ , 
proving [ii). 

Next, using the Lipschitz continuity of h and g, we get 
\J{5u) - J{5u')\ 

<E ^ (\h{xt, ut) - h{x't, ut)\ + \h{x't, ut) - Hx't, ui)\yt + E\g{xT) - gix'T)\ 

By {ii) and the boundedness of h we conclude that there exists K > such that 

\J{Su)-J{Su')\<K{d{u,u')Y^\ 



Lemma 6.2. Let u,u' GU along with the corresponding state processes xt,x^ 
and adjoint processes {pt,<lt),{Pt,Qt),{PtJ<lt)'{Pt'Q't) given. Then it holds 
that 

E 



^ (\pt-p't? + \qt-q'tf)dt<K{d{u,u')Y'\ (6.16) 
- P,f + IQt - Q[\^)dt < K{d{u,u')f'\ (6.17) 



E 



for some constant K > Q. 

Proof. Let {pt, qt) = {pt — p't^lt ~ 9t)> which then satisfies the BSDE 

dpt = - {(bi (u'i)pi + ■4't{u't)qt + ^tjdt + qtdBt 
PT= gxixr) - gxix'r), 
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where 
Further, let 



dB, 



where sgw{qt) = (sgn((jj ), . . . , sgn(gj')) for the d-dimensional vector qt- Now, by 
defining 



Zf = 1 - / (t)s{Us)Zsds, 







and applying Ito's formula on ztyt, we get 



t rt 



n{ps)ds+ / (^ZsUsipsius) + Zs\qs\sgn{qs)^dBs 



Ztyt = / Zs\ps\sg: 

Jo Ja 

Note that E|z~-'^|^''' < oo for any p > 1 by (A. 3). Since ipt is bounded, the 
Burkholder-Davis-Gundy and Gronwall inequalities yields 

E\zy\*/P < KeJ^ Iztl'^ilPtl'" + \qt\''')dt, 
for any p (zN. We can conclude using the Cauchy-Schwarz inequality that 

/ fT \ 1/2 

^E\z-'zy\lP <k(eJ^ \zt\^p{\pt\^P + \qt\^Pyt) <oo, 

for any p €N. Moreover, by applying Ito's formula on ptyt and taking expecta- 
tions, we obtain 

E I lpf\pt\sm(pt) + qt\qf\sm(Qf)]dt 



(pt\pt\sgn{pt) + qt\qt\sgn{qt)y 
=e(^^ ^tytdt+{g^{xT)-9Ax'T))yT^ (6.18) 
<k\^(eJ^ l^tfdtY +(e|5,(xt)-5x(^t)I')'^'}(e|2/It')'^'- 



Using that the left hand side of (|6.18p is equal to E (|PtP + \qt\'^)dt, we 
conclude that 



E 



1/2 
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As for the first term on the right hand side, we have by the Lipschitz continuity 
of hr that 



E 
<KE 
<KE 



\hx{xt,ut) - hx{x[,u[)\^dt 

T 



\K{xt,Ut) - K{x[,ut)\'^ + \K{x[,ut) - hx{xt,u't)\'^jdt 
(\xt - a;;p + \h,{x't,ut) - h,{x't,u't)\%,,^^,,}yt < K{d{u,u')) 



1/4 



Moreover, by the estimate 



y{(t>t{ut) - <l)t{u[))pt + (iptiut) -'tpt{u't))qt Jdt 

/ 2 2 

y (Mut) ~ (t)t{ut))pt I{u#u'} + [i'tiut] - A{u't))qt I{u#u'} 



dt 



<K{dMY'\ 
we deduce 



E 



\U^dt] <K{d{u,u')) 



1/8 



Finally, by the Lipschitz continuity of gx , we have the stimate 

which proves (j6.16p . Obviously, the same arguments prove (j6.17p . ■ 

Proof of Proposition I4.2i Recall that by Lemma I6.1[ J : {U, d) i— > M is 
continuous. Thus, by Ekeland's variational principle, see Ekeland [5] or Zhou 
|13j . we can find a strict control ut such that 



d{u,u) < 6^/3 



and 



J (Si) < J{5u) + e^'^d{u,u), 
for every u U. Next, we consider the perturbed control u^: 



V for t G [r, T 
Ut otherwise. 



Thus, ()6.19|) and the fact that d{u^ , u) < 9 implies 



(6.19) 
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By Lemma 14.11 the left hand side is equal to 

E / (H^^''^*Ht,£t,ut) - H^^"^'\t,it,u1)yt + o{0). 



Since 



E 



r+fl 





E 



and differs from u only on [r, t + 6*] , we get 

(rL^^''^'\t,xt,ut)-'H^^''^'\t,it,v)^dt + o{e) > -t^/^e. 

If we divide by 9 and let 0^0, this yields 

¥.{n^'^-''~'-\T,ir,Ur)-'H^^-^^-\T,Xr,v)^ > (6.20) 

The next step is to replace u with u, i.e we want to estimate 

xt, ut) - (6.21) 
We do this term by term. With h and a as in (|2.2p . we may write p.2ip as 

--E y (^a{t,xt,Uf) ~ a{t,xt,ut)jPt(^a{t,xt,Uf) - a(t,xt,ut)^dt 

+E y qt(b{t,Xt,Ut) - b{t,xt,ut)jdt ~E J qt(b{t,Xt,uf) - b{t,Xt,Ut)jdt 

+E y Pt(a{t,xt,Ut) - a{t,xt,ut)^dt -E J Pt(a{t,xt,uf) - a{t,xt,ut)^dt 

+E / ^/i(i, it, {t( ) — /i(t, it, Ut)^ (it — E / (ji(t,xt,u^) — h(t,xt,ut)jdt. 

(6.22) 
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We start by estimating the third hnc in the above expression, 
Jo 

=E / [dt - QtM b(t,xt,u'!) - bit, Xt,Uf) ]dt 



qt (b{t,Xt,Ut) - b{t, xt,ut)jdt 

J (^qt - qtj(b{t,Xt,Ut) - b{t,xt,ut) 

qt(b{t,Xt,Ut) - Kt^^t,u^)^dt + E / qt(b{t,xt,ut) - b(t,xt,ut))dt 



=h+h+h- 

We have the foUowing estimate of the first term, using Lemma 16.21 and the 
integrabihty of the components of •, •). 

1/2 / i-T ^ 1/2 

E/ \b{t,5:t,u1) ~b{t,it.ut)\^dt 



/i <(^E^ \qt~qt?dt^ (e 

As for the second term, we get using Lemma |6. II 

/2<(^IE^ \qt?dt^ ' (eJ^ \M4)[it - xt]\^^ ' 



<k[e 

V Jo 



T X 1/4 



\Xt 



Further, 



/a = E y qt{b(t,xt.,ut) -b{t,xt,ut)^dt ^-E J qt(b{t,Xt,Ut) ~ b{t,xt,ut)jdt 
< (^E^ (eJ^ \i^tiut) - Mut) + [Mut) - Mut))it\\u^u}d?j 

<X(ei/6 + el/24). 
We can conclude that 



E 







qt(b{t,Xt,Ut) - b{t,xt,ut)jdt -E / qt(b{t,Xt,uf) - b{t,Xt,Ut)jdt 



By similar calculations we are able to get the same estimates for the other terms 
in (|6.22[) . Combining this with (|6.20p . and since w G C/ is arbitrary, (|4.7p follows. 
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Proof of Lemma HH Let (p^'^), gW), (PC^), QC^)) and {p,q),{P,Q) be the 
processes given by (j4.5p and (|4.6p corresponding to x^''^ and a;^ respectively. 
Define 

zt = l- MMzsds, and ' = 1 - / </),(uW)z('=)ds. 
Jo JO 

Further, we denote {pt,qt) — {ztPt ~ z'l^pi\ztqt — z^l^^qi^^, and simlarly for 
the second order adjoint processes (with replaced by 20 above). The proof is 
carried out in three steps: 

{%) linifc^oo E (v\ + \qt?)dt = 0, 
ill) limfe^oo E/J^ (p2 + |g,|2)dt = 0, 

(wi ) linifc^oo E \l H^^'" , mI''^ )dt = E /^f H^** ■'^'^ , ^ * , A* 

We first prove (i). By applying Itos formula on p\ we have 
p\ + y Igsptis = {zTgxixT) - zPgx{x'^^)) 

where 

$4 =ipt{fj.t)ztqt + ZtK{xt, fit) 

Taking expectations and using Young's inequality on the second term on the 
right hand side yields 



ip^t+^ I lit 



fds<E(^ZT9x{xT)-zPg^{xP)j +a^EJ p^Js 
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Expanding the last term: 

i-T 



+ E / \zs- zi''^\\h^{xs,ils)\ds + E |zW||/i^(x,,/t,)-/i^(i,,u('=))| 
Jt Jt 

/■^ 

+ E 



where the first and fourth term converges to since S {k)dt (j,Adu)dt P-a.s. in 

£([0,T] X U). Similarly, the third term converges to since Ej^^'') — z\y^^ — > 
for any p > I. The last term converges to since hx is Lipschitz continuous 
and EI.t'^'^) — 0. Inserting this expression into the above, and since ip 

is bounded by some constant C, choose a such that CKjo? < 1, it follows by 
applying Gronwall's inequality that 

Ep2 \qt\''ds <Ke(^zt9.{xt) - zPgx{xP)f. 

Observing that ¥j{^gx{xT) — gx{xx^))'^ ^ by the Bounded Convergence Theo- 
rem as well as 

E|^(*=) - 0, the result follows. 

{a) is proven with the same arguments. 

Now proving {iii) is straightforward by making use of (i), {ii) and the Chattering 
Lemma. ■ 
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